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TECHNICAL FIELD 

The present invention relates generally to data analysis, and more particularly to 
systems and methods for improving collaborative filtering. 



BACKGROUND OF THE INVENTION 

The use of data analysis tools has increased dramatically as society has become 
10 more dependent on digital information storage. In e-commerce and other Intemet and 
non-Intemet applications, databases are generated and maintained that have 
astronomically large amounts of information. Such information is typically analyzed, or 
"mined," to learn additional information regarding customers, users, products, etc. This 
information allows businesses and other users to better implement their products and/or 
15 ideas. 

Electronic commerce has pervaded almost every conceivable type of business. 
People have come to expect that their favorite stores not only have brick and mortar 
business locations, but that they can also be accessed "online," typically via the Internet's 
World Wide Web. The Web allows customers to view graphical representations of a 

20 business' store and products. Ease of use from the home and convenient purchasing 
methods, typically lead to increased sales. Buyers enjoy the freedom of being able to 
comparison shop without spending time and money to drive from store to store. 

Online commerce has continuously developed to bring a more enjoyable buying 
experience to online buyers. Often, websites require a "log in" and/or utilize a "cookie" 

25 to track which buyer is looking at their website. With this information, a business can 

track purchase parameters such as type, size, quantity, and purchasing frequency. This is 
valuable information because it allows a company to forecast fixture sales and to 
determine what goods are of the most interest to online buyers. Typically, however, 
people are individual in nature and each person tends to have slightly different likes and 

30 dislikes. For example, a company who sells a lot of cellophane tape online might assume 
that their buyers are utiUzing it for craft project building purposes. Since the company 
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also sells colored glitter, they may include an advertisement for glitter next to their tape 
advertisement on their website. In actuality, however, most of the customers are 
^ purchasing the tape for business office use, and the glitter advertisement may even turn 

some customers away due to the fact that the company seems to not understand its 
5 customer's needs correctly. The glitter advertisement could then even lead to decreased 
tape sales. Had the company, instead, offered staples and/or paper clips along with the 
tape, they might have seen increased sales for all of their products as buyers might now 
perceive their store as a "one-stop shop" for all of their business office supply needs. 

Pairing up items for selUng is often known as "associative selling." An effort is 

10 made to correlate various items/products based upon a particular buyer's past buying 
habits and/or the past buying habits of other buyers who purchased similar items in the 
past. This associative process can also be expanded beyond direct product sales. It can 
be utihzed indirectly to enhance sales such as with television viewing habits. A 
television company can predict that most viewers of show X are men who prefer rugged 

15 sports such as football, extreme mountaineering, and rugby. This would give the 

television company a good idea that programming an opera or ballet in this time slot 
would probably reduce their viewer ratings. Even the existing show could be "enhanced" 
with more rugged content to increase the size of show X's audience. A successful show 
with a large audience naturally draws advertisers who want to reach more of their market. 

20 Thus, the viewing habits can even be used to provide appropriate commercials that have a 
high audience acceptance rate for a particular genre of viewers. 

Prior to the advent of online selling, a salesperson would typically approach a 
customer and ask them a series of questions to better understand their likes and dislikes 
along with their prior purchasing habits. Through this interaction, the salesperson is able 

25 to determine suggestions for products this particular customer might like. This same type 
of "associative selling" is also just as important to online merchants. However, online 
there is not a salesperson to "size up" a customer and determine their needs and wants. 
Instead, programs are utilized to determine suggestions for online buyers when they visit 
a business' website. For example, consider an online buyer who previously bought a dog 

30 bowl and a dog bone. ProbabiUties can be determined that show that it is likely that this 
person owns a dog. The person might, therefore, be interested in dog related items such 
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as dog collars, leashes, and brashes. Since these items are brought to the attention of the 
buyer, if it matches their needs, they are more likely to purchase those items than, for 
instance, an advertisement for catnip or a bird feeder. 

Although associative type selling is extremely advantageous, it is also generally 
5 very difficult to actually determine product associations. This is generally due to . 

complex computing requirements, difficulty in accessing and retrieving the necessary 
information, and/or long computational calculation times. If a method is inaccurate, it 
can possibly drive customers away, causing losses in sales. A man who bought his wife a 
toaster oven and pajamas for her birthday does not necessarily want to be constantly 

10 bombarded with hair curler and beauty aid advertisements. Just Uke with a good 

salesperson, correctly associated products can lead to increased sales, while, like a bad 
salesperson, incorrectly associated products may cause a decrease in sales. Therefore, it 
is important to have an accurate means to associate various products/items for diverse 
individuals. This includes those with esoteric tastes who visit a website only once in a 

1 5 great while, along with those who have more traditional tastes and buy frequently from 
the same website. 

Techniques that attempt to determine preferences of a user are known as 
collaborative filtering. A collaborative filtering system can produce recommendations by 
determining similarities between one user and other users. The value of this type of 
20 information to society increases daily as we move towards an electronic oriented 

environment where our preferences can be easily disseminated to us by any nxmiber of 
means such as computers, televisions, satellite radios, and other devices that lend 
themselves to the potential of having interactivity with a user. 



25 SUMMARY OF THE INVENTION 

The following presents a simplified summary of the invention in order to provide 
a basic understanding of some aspects of the invention. This summary is not an extensive 
overview of the invention. It is not intended to identify key/critical elements of the 
invention nor to delineate the scope of the invention. Its sole purpose is to present some 
30 concepts of the invention in a simpUfied form as a prelude to the more detailed 
description that is presented later. 
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The present invention relates generally to data analysis, and more particularly to 
systems and methods for improving collaborative filtering (CF) such as memory and/or 
model based CF and the like. Statistical smoothing methods are leveraged to quickly 
create models that can efficiently predict the probabiUty that a user Ukes an item and/or 
5 similarities between items. By improving collaborative filtering, dramatic increases in 
performance are obtainable in product-to-product reconmiendations. This facihtates in 
simplifying user interfaces and increasing user satisfaction with products/systems that 
employ the present invention. The present invention also provides improvements over 
systems based on dependency nets (DN) in both areas of quality of recommendations and 

10 speed of model creation. It can also be complementary to DN to improve the value of an 
existing collaborative filtering system's overall efficiency. 

The present invention also facilitates data analysis by providing a means to create 
a collaborative filtering system that is computationally efficient and utilizes a minimal 
amount of memory. This allows a CF system to reside within devices that have low 

15 computational power and small memories. Servers will benefit firom being able to readily 
provide recommendations quickly and more accurately while television set-top boxes 
with minimal memory will be able to likewise make recommendations that were 
previously restricted due to computational and memory requirements. This flexibility 
drastically increases the usefulness of collaborative filtering and allows more users to 

20 integrate CF into their businesses and also products, bringing a more user-friendly 
experience for customers. 

To the accomplishment of the foregoing and related ends, certain illustrative 
aspects of the invention are described herein in connection with the following description 
and the annexed drawings. These aspects are indicative, however, of but a few of the 

25 various ways in which the principles of the invention may be employed and the present 
invention is intended to include all such aspects and their equivalents. Other advantages 
and novel features of the invention may become apparent fi'om the following detailed 
description of the invention when considered in conjunction with the drawings. 
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BRIEF DESCMPTION OF THE DRAWINGS 

FIG. 1 is a block diagram of an analysis system in accordance with an aspect of 
the present invention. 

FIG. 2 is a block diagram of a collaborative filtering system in accordance with an 
5 aspect of the present invention. 

FIG. 3 is a block diagram of a measure of association selection system in 
accordance with an aspect of the present invention. 

FIG. 4 is a block diagram of a smoothing system in accordance with an aspect of 
the present invention. 

10 FIG. 5 is a flow diagram of a method for data analysis in accordance with an 

aspect of the present invention. 

FIG. 6 is a flow diagram of a method for collaborative filtering in accordance 
with an aspect of the present invention. 

FIG. 7 is a flow diagram of a method for choosing measures of association for 
15 collaborative filtering in accordance with an aspect of the present invention. 

FIG. 8 is a flow diagram of a method for smoothing maximum likelihood 
estimators (MLE) estimates for collaborative filtering in accordance with an aspect of the 
present invention. 

FIG. 9 is a flow diagram of a method for scoring higher-order item sets for 
20 collaborative filtering in accordance with an aspect of the present invention. 

FIG. 10 illustrates an example operating environment in which the present 
invention can fimction. 

FIG. 1 1 illustrates another example operating environment in which the present 
invention can function. 

25 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention is now described with reference to the drawings, wherein 
like reference numerals are used to refer to like elements throughout. In the following 
description, for purposes of explanation, numerous specific details are set forth in order 
30 to provide a thorough imderstanding of the present invention. It may be evident, 

however, that the present invention may be practiced without these specific details. In 
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other instances, well-known structures and devices are shown in block diagram form in 
order to facilitate describing the present invention. 

As used in this application, the term "component" is intended to refer to a 
computer-related entity, either hardware, a combination of hardware and software, 
5 software, or software in execution. For example, a component may be, but is not limited 
to being, a process running on a processor, a processor, an object, an executable, a thread 
of execution, a program, and/or a computer. By way of illustration, both an application 
running on a server and the server can be a computer component. One or more 
components may reside within a process and/or thread of execution and a component 

10 may be localized on one computer and/or distributed between two or more computers. A 
"thread" is the entity within a process that the operating system kernel schedules for 
execution. As is well known in the art, each thread has an associated "context" which is 
the volatile data associated with the execution of the thread. A thread's context includes 
the contents of system registers and the virtual address belonging to the thread's process. 

15 Thus, the actual data comprising a thread's context varies as it executes. 

The present invention provides improvements to systems and methods for 
collaborative filtering. The improvements include new algorithms for collaborative 
filtering. The algorithms are based on association rules with various methods of 
statistical smoothing {maximum likelihood/support and expected values/priors). They 

20 ' quickly create models that can efficiently predict both the probability that a user will like 
an item and/or the similarity between items. 

Collaborative filtering (CF) is the task of predicting user preferences over items 
such as books, television shows, movies, and web pages. Collaborative filtering helps 
drive sales at Internet commerce sites such as book, music, and clothes retailers and the 

25 like. The present invention dramatically improves the recommendations available to 
customers in several important scenarios. This invention helps sites utilizing online 
processing tools compete with large complex online sites by improving the quality of a 
seller's product-to-product recommendations. CF is also a central feature for digital 
media systems such as digital television recording systems. The present invention 

30 improves digital media offerings by simplifying the user interface and increasing user 
satisfaction. It is memory efficient enough for deployment on set-top boxes and other 
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small memory/processor devices and the like. It is also computationally efficient enough 
for deployment on servers where processing time is critical 

Early CF technology made good recommendations but did not run fast enough for 
broad utilization on servers. These processes were also too memory intensive for many 
5 television applications. Second generation technology, utiUzing dependency nets (DN), 
was computationally fast and more memory efiRcient. However, in some scenarios, 
model creation was slow and the quality of its recommendations became poor. The 
present invention speeds up model creation and improves the quality of recommendations 
in scenarios for which DN is weak. It matches DN's speed, and yet, is an extremely 

10 memory efficient method, making it suitable for deployment on even low memory set-top 
boxes and the like. It can also be combined with DN, creating a fast system that gives 
good recommendations over a broad range of scenarios. 

In FIG. 1, a block diagram of an analysis system 100 in accordance with an aspect 
of the present invention is shown. The analysis system 100 is comprised of an analysis 

15 system component 102. In this instance of the present invention, the analysis system 

component is comprised of a collaborative filtering system component 104. Data 106 is 
input into the system 100 and processed by the collaborative filtering system component 
104 to produce a score 108 for the data. The score 108 represents a value of a measure of 
association for the input data 106. In this manner, multiple data can be input to provide a 

20 list representative of various scoring values. A higher value can indicate a more 

preferential association than a lower score value. Subsequent associations can also be 
derived fi-om the list based upon the computed scores. As an example, several television 
show preferences can be input as data. A resulting scoring list might show several scores 
in close proximity. It can be assumed that a television viewer who liked one of the shows 

25 in that grouping might also like the other shows in the grouping. 

Turning to FIG. 2, a block diagram of a collaborative filtering system 200 in 
accordance with an aspect of the present invention is illustrated. The collaborative 
filtering system 200 is comprised of a collaborative filtering system component 202 with 
input data 212 and a resulting score 214. The collaborative filtering system component 

30 202 is comprised of a measure of association (MOA) computing component 204, a 
measure of association selection component 206, a smoothing component 208, and a 
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maximum value analysis component 210. The MO A selection component 206 selects a 
desired or proper measure of association that is utilized by the MOA computing 
component 204 to determine a measure of association for the input data 212. Techniques 
and details of the MOA selection component 206 are described infra. The smoothing 
5 component 208 facilitates in smoothing out maximum hkelihood estimator (MLE) 
estimates that are utilized by the MOA computing component 204. Details and 
techniques with regard to this component are discussed further infra. The maximum 
value analysis component 210 determines an appropriate score for a measure of 
association when multiple scores are available due to multiple measures of association 

10 rules being applicable to the input data 212. A maximum value among the rules can be 

established and utilized as the output resulting score 214. Techniques for determining the 
maximum value are detailed infra. 

Referring to FIG. 3, a block diagram of a measures-of-association selection 
system 300 in accordance with an aspect of the present invention is illustrated. The 

15 MOA selection system 300 is comprised of an MOA selection component 302 and an 
MOA computing component 322. The MOA selection component 302 determines a 
measure of association to be utilized by the MOA computing component 322. It 302 is 
comprised of an MOA selector 304, a combined MOA 306, a probable items MOA 308, a 
similar items MOA 310, and additional MOA 312. Each entity 306-3 12 can be 

20 comprised of multiple like entities. The probable items MOA 308 utilizes a probabiUty 
algorithm, for example P(/ 1 3 14, as a basis for its measure of association. The similar 
items MOA 310 utilizes a similarity algorithm for its measure of association. Examples 
of similarity algorithms include Weight of Evidence (WOE) algorithms 316 and/or Lift 
algorithms 318 and the like. Other algorithms 320 can be employed as a basis for 

25 measures of association for the additional MOA 312. The combined MOA 306 utilizes 
algorithms 318 to provide a measure of association that can include such aspects as 
probability and similarity. The algorithms 318 can include, for example, algorithms 314- 
318 utilized by other MOA's 308-310. Other unique algorithms can also be employed as 
represented by Other algorithms 320. The MOA selector 304 selects an appropriate 

30 and/or desired measure of association that is to be employed by the MOA computing 
component 322. 
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Moving on to FIG. 4, a block diagram of a smoothing system 400 in accordance 
with an aspect of the present invention is shown. The smoothing system 400 is 
comprised of a smoothing component 402 and an MOA computing component 426. The 
smoothing component 402 facilitates the MOA computing component 426 by providing 
5 smoothing of MLE estimates employed in scoring data. The smoothing component 402 
is comprised of a smoothing selector 404, a support cutoff smoother 406, a prior on MOA 
smoother 408, a prior on counts smoother 410, and an additional smoother 412. Each 
entity 406-412 can be comprised of multiple like entities. The support cutoff smoother 
406 utilizes a support level 414 (i.e., threshold) to provide smoothing. If a particular 

10 entity/item does not have adequate support, it is rejected. This is elaborated infra. The 
prior on MOA smoother 408 utilizes prior knowledge such as, for example, a posterior 
probability distribution 416 and/or an asymptotic standard error (ASE) 418, to provide 
smoothing. Details are described The prior on counts smoother 410 employs an 
added count such as, for example, a single count "r" and/or a multiple count "r," to 

15 provide smoothing. This is discussed further m^a. The additional smoother 412 can 
incorporate other techniques 424 that can include, for instance, combinations of the 
named smoothers 406-410 and/or additional smoothers that provide smoothing based on 
combinations of other smoothing techniques 414-422 and/or additional smoothing 
techniques. 

20 Comprehension of the present invention can be faciUtated by exploring in detail 

how entities mentioned supra operate and interrelate. The present invention employs 
pairwise association rules for collaborative filtering. Pairwise association rules are a well 
known way of representing the idea that interest in one item may indicate interest in 
another. For example, the following association rule: 

25 

bread peanut butter | 60% (1) 

represents that **the probabiHty of peanut butter given bread is 60%." Thus, people who 
have or buy peanut butter have a 60 percent likelihood of also having or wanting bread 
30 (most likely so that they can make a sandwich). In this fashion, sets of items can be 
compared to estabhsh associative rules that can be applied to future data. 



9 



MS 180583.1 



10 



The present invention creates a collaborative filtering system from association 
rules by first choosing appropriate measures of association (MO A). The association 
between two items can be measured many ways. When it is desirable for a CF system to 
recommend probable items then a conditional probability based MOA can be appropriate. 
For example, for the association rule: 

I (2) 

a measure of association can be given by: 

P{I\E) (3) 



which is the probability of / given When it is desirable for a CF system to recommend 
similar items, Weight of Evidence (WOE) and/or Lift can be more appropriate. Weight 
15 of Evidence from to / is given by: 



Ir.™ (4) 
P{E\I) 



20 



and Lift is given by: 

P{I\E) 
Pil) 



(5) 



Other measures of association are also possible including those that give weight to both 
similarity and general popularity. 
25 Once an appropriate or desirable measure of association has been established, the 

measures of association are computed from data input into a collaborative filtering 
system. A naive way to compute measure of association is with a maximum-likelihood 
estimator (MLE) that can be easily determined from the data. The computed measure of 
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association becomes a score of an item /. Items can then be sorted by their scores to 
create a recommendation list. 

In CF systems it is often the case that multiple association rules can imply the 
same item /. One method of the present invention to get a single score for item /, when 
5 multiple items imply it, is to give it a maximum value determined from rules that imply 
the item. 

A well-known generalization of pairwise association rules is to have more than 
one item on a left-hand side (LHS) of the rule. For example: 

10 bread, jelly -> peanut butter 1 55% (6) 



indicates that "the probability of peanut butter given bread and jolly is 55%." 

When applied to collaborative filtering, this introduces an issue of how to 
combine effects of multiple rules that have overlapping left-hand sides and a same right- 
15 hand side (RHS). For example, consider two rules that overlap such as: 



E1,E2-^ 1 1 60% 

and 

El 1 1 55%. 

20 

One way to solve the issue is to give an item a maximum value of the rules that imply it, 
even when those mles have overlapping left-hand sides. This method works especially 
well with measures of association in which more specific (longer) rules tend to have 
stronger associations, as is the case with Lift and Weight of Evidence, but not 

25 Conditional Probability. 

A problem when using maximum-likelihood estimators (MLE) when computing a 
measure of association is that a measure can be overly sensitive to particular observed 
counts. For example, if only one person in a data set bought bread and that person also 
bought peanut better, the MLE for the conditional probability score for peanut butter 

30 given bread is 1, the Lift score is also 1, and the Weight of Evidence score is infinity. In 
all three cases, they represent the most extreme scores achievable. Statistical techniques 



(7) 
(8) 
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for computing measures that are less sensitive to coincidence than MLE are called 

smoothing. 

One approach to achieve smoothing with the present invention is smoothing with 
support cutoffs, hi this approach, only rules of association with a minimum level of 
5 support are considered. Support, in this case, is the number of users in a data set who 
expressed interest in all products mentioned in a rule. For example, only association 
rules with a support of at least 10 might be considered (this would estabUsh a "threshold" 
of 10 in this example). When a measure of interest employed is Weight of Evidence (and 
in other cases also) another definition of support is appropriate. It is defined, in this 
10 instance, as a minimum of counts of a, b, c, rf, where: 

• a is a number of users who expressed interest in none of the items; 

• is a number of users who expressed interest in an item on a right-hand side of an 
15 association rule, but not in all items of a left-hand side of the association rule; 

• c is a number of users who expressed interest in all items of the left-hand side of 
the association rule, but not in an item on the right-hand side of the association 
rule; and 



20 



is a number of users who expressed interest in all items mentioned in the 
association rule. 



Another, complementary, way to smooth is by adding a pseudo count of r (where 
25 r is a real value greater than 0) to all counts before calculating MLE values. This is 

referred to as smoothing with a prior on counts. For example, if r is J4, then the present 
invention computes an MLE with counts a-^Vi, c+Vi, and d+Vz. More generally, 
different counts can be added to different cells instead of having a single value count. 
The smoothing methods given so far address the problem of coincidence among 
30 the counts for a given association rule. When many association rules are considered, a 
second problem with coincidence appears, namely, as more and more association rules 
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are employed, the expected number of rules with apparently strong association by chance 
increases. 

One method to address this problem employed by the present invention is to put a 
prior probability distribution on some of the uninteresting measures of association. This 
5 is known as smoothing with a prior on a measure of association or "informative priors on 
measures of association." Informative prior is a term of art signifying that background 
knowledge is applied to set a prior probability. This is contrary to an "uninformative 
prior" which typically is a uniform distribution over a range of plausible values. For 
example, suppose that based on prior knowledge it is expected that there is to be no 

10 association in 99% of the pairwise associations considered and no association in 99.9% of 
higher order associations. Moreover, suppose Weight of Evidence is being employed. 
Then utilizing techniques, such as Bayesian methods disclosed in CFW technical report 
MSR-TR-2002-46 (CFW: A Collaborative Filtering System Using Posteriors Over 
Weights of Evidence; Carl M. Kadie, Christopher Meek, and David E. Heckerman), a 

1 5 prior of 99% (or 99.9%) can be put on WOE = 0 and then a posterior probability 

distribution on WOE can be computed. This posterior distribution can be made into a 
score by the methods mentioned in the CFW technical report, id, and/or by computing 
the expected value of the Weight of Evidence. 

Similarly, classical statistical methods can give asymptotic standard errors of the 

20 MLE. The test on the error can be made more difficult as the number of association rules 
to be considered increases. Classical measures (estimates) of association for 2x2 tables 
have been studied for decades. Among the most popular is the cross-product ratio (also 

called the odds ratio), which on table [[^z,6],[c,ti]]is 0 =ad/ be. Other measures include: 
25 • the log of the cross-product ratio; 

• Yule's Q, which is the 2x2 version of the Gamma measure, 

• Phi, which is the 2x2 version of Pearson's correlation coefficient, and 

30 
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• various "tau" measures {see, Reynolds, H.T. (1984); Analysis of Nominal Data, 
2nd Edition; Sage Publications; Newbury Park, CA and Gibbons, J.D. (1993); 
Nonparametric Measures of Association-, Sage Publications; Newbury Park, CA. 

5 The accuracy of these measures of association can be characterized with the asymptotic 
standard error (ASE) of the estimator. For example, the standard error for the log of the 
cross-product ratio {see, Agresti, A. (1990); Categorical Data Analysis; John Wiley & 
Sons, New York) is given by: 

10 se{loge) = (1/a + l/b -f l/c + Vd)^^^ . (9) 

The standard error can then be used to estimate a confidence interval due to the 

asymptotic normality of the estimator log^ . 

In view of the exemplary systems shown and described above, methodologies that 

15 may be implemented in accordance with the present invention will be better appreciated 
with reference to the flow charts of FIGs. 5-9. While, for purposes of simplicity of 
explanation, the methodologies are shown and described as a series of blocks, it is to be 
understood and appreciated that the present invention is not limited by the order of the 
blocks, as some blocks may, in accordance with the present invention, occur in different 

20 orders and/or concurrently with other blocks firom that shown and described herein. 

Moreover, not all illustrated blocks may be required to implement the methodologies in 
accordance with the present invention. 

The invention may be described in the general context of computer-executable 
instructions, such as program modules, executed by one or more components. Generally, 

25 program modules include routines, programs, objects, data structures, etc, that perform 

particular tasks or implement particular abstract data types. Typically the functionality of 
the program modules may be combined or distributed as desired in various embodiments. 

In FIG. 5, a flow diagram of a method 500 for data analysis in accordance with an 
aspect of the present invention is depicted. This method 500 is an overall flow 

30 illustrating how a score is computed for data inputted into a data analysis system. The 
method 500 starts 502 by creating a collaborative filtering system based on association 
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rules 504. The association rules can be based on probability and/or similarity based 
rules. A determination is made as to whether smoothing for maximum likelihood 
estimator (MLE) estimates is desired 506. If yes, the MLE is smoothed utilizing various 
smoothing methods described supra and infra 510. A measure of association score is 
5 then determined 508, ending the flow 512. If smoothing is not desired 506, a measure of 
association score is determined 508, ending the flow 512. 

Turning to FIG. 6, a flow diagram of a method 600 for collaborative filtering in 
accordance with an aspect of the present invention is shown. This method 600 illustrates 
how the present invention's collaborative filtering system utiHzed in a data analysis 

10 system operates at an overview level. The method 600 starts 602 by choosing 

appropriate measures of association 604. These techniques are elaborated both supra and 
infra. The appropriate measure of association is then utilized to compute a measure of 
association score fi*om data input into a system 606. A determination is then made as to 
whether there are multiple rules of association that are applicable to a given piece of data 

15 or item 608. If not, the computed association score is maintained 610, ending the flow 
612. If multiple rules are applicable 608, effects of the multiple rules are combined 614. 
An .optimum value is then obtained fi'om the multiple rules and provided as a measure of 
association score 616, ending the flow 612. 

Moving on to FIG. 7, a flow diagram of a method 700 for choosing measures of 

20 association for collaborative filtering in accordance with an aspect of the present 

invention is illustrated. The method 700 starts 702 by determining if probable items are 
desired 704. If yes, a conditional probability algorithm is employed to determine a score 
for an item based on the likelihood that it will occur 706 and the method 700 continues. 
If probable items are not desired, a determination is made as to whether similar items are 

25 desired 708. If yes, a Weight of Evidence algorithm and/or a Lift algorithm are 

employed to determine an association score 710 and the method 700 continues. If similar 
items are not desired 708, a determination is made as to whether a combination of 
similarity/popularity based items is desired 712. If yes, an algorithm/method is utiUzed 
to weight both similarity and general popularity to determine an association score 714, 

30 ending the flow 716. If similarity/popularity of an item is not desired 712, the flow ends 
716. The present invention can employ these methods md/or algorithms and also other 
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methods and/or algorithms dependent upon desirable characteristics for a measure of 
association score. 

In FIG. 8, a flow diagram of a method 800 for smoothing maximum likelihood 
estimator (MLE) estimates for collaborative filtering in accordance with an aspect of the 
5 present invention are depicted. The method 800 illustrates several methods of smoothing 
in one instance of the present invention. The method 800 starts 802 by making a 
determination as to whether a support cutoff method is desired 804. If yes, a 
determination is then made as to whether a Weight of Evidence method is desired 806. If 
no, a support value or "threshold" is determined based on a number of users who 

10 expressed an interest in products of an association rule 808 and the method 800 

continues. This value is a minimum acceptable count for incorporating effects from that 
particular association. Anything below this threshold is not utilized. If Weight of 
Evidence is desired 806, a minimum acceptable value or threshold is determined based on 
counts of parameters a, c and d as defined supra and the method 800 continues. If 

1 5 support cutoff is not desired 804, a determination is made as to whether a prior on counts 
is desired 812. If yes, pseudo counts are added to determine counts in either via a single 
pseudo count or multiple pseudo counts per cell 814. Since this method is 
complementary, the pseudo counts can be incorporated and then an MLE can be utilized 
to determine estimates 816 and the method 800 continues. If, however, prior on counts is 

20 not desired 812, a determination is made as to whether a prior on measures of association 
is desired 818. If not, the flow ends 830. However, if yes, a determination is made as to 
whether Weight of Evidence is desired 820. If yes, Bayesian techniques are employed as 
described supra to give prior distributions based on weights of evidence 826, ending the 
. flow 830. If weights of evidence are not desired 820, a determination is made as to 

25 whether employing an error method is desired 822. If yes, an asymptotic standard error 
is determined for an MLE utiUzing methods of the present invention and/or classical 
statistical methods described supra 828. If it is not desired to employ error techniques 
822, a prior probabiUty distribution is utilized with regard to a subset of measures of 
association 824, ending the flow 830. 

30 Referring to FIG. 9, a flow diagram of a method 900 for scoring higher-order item 

sets for collaborative filtering in accordance with an aspect of the present invention is 



16 



MS180583.1 



shown. The method 900 starts 902 by determining if multiple rules exist that imply an 
item/piece of data 904. Examples of higher order item rules are given 5wpra. Typically, 
if the item/piece of data is not impUed by multiple rules, its original computed measure of 
association score is maintained, ending the flow 908. However, if the item/piece of data 
5 is implicated in multiple rules, a maximum value is determined out of scores based on the 
impUcating rules 906, ending the flow 908. 

In order to provide additional context for implementing various aspects of the 
present invention, FIG. 10 and the following discussion are intended to provide a brief, 
general description of a suitable computing enviromnent 1000 in which the various 

10 aspects of the present invention may be implemented. While the invention has been 

described above in the general context of computer-executable instructions of a computer 
program that runs on a local computer and/or remote computer, those skilled in the art 
will recognize that the invention also may be implemented in combination with other 
program modules. Generally, program modules include routines, programs, components, 

15 data structures, etc. that perform particular tasks and/or implement particular abstract 
data types. Moreover, those skilled in the art will appreciate that the inventive methods 
may be practiced with other computer system configurations, including single-processor 
or multi-processor computer systems, minicomputers, mainframe computers, as well as 
personal computers, hand-held computing devices, microprocessor-based and/or 

20 programmable consumer electronics, and the like, each of which may operatively 
communicate with one or more associated devices. The illustrated aspects of the 
invention may also be practiced in distributed computing environments where certain 
tasks are performed by remote processing devices that are linked through a 
communications network. However, some, if not all, aspects of the invention may be 

25 practiced on stand-alone computers. In a distributed computing environment, program 
modules may be located in local and/or remote memory storage devices. 

As used in this application, the term "component" is intended to refer to a 
computer-related entity, either hardware, a combination of hardware and software, 
software, or software in execution. For example, a component may be, but is not limited 

30 to, a process running on a processor, a processor, an object, an executable, a thread of 
execution, a program, and a computer. By way of illustration, an application running on 
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a server and/or the server can be a component. In addition, a component may include 
one or more subcomponents. 

With reference to FIG. 10, an exemplary system environment 1000 for 
implementing the various aspects of the invention includes a conventional computer 
5 ^ 1002, including a processing unit 1004, a system memory 1006, and a system bus 1008 
that couples various system components, including the system memory, to the processing 
unit 1004. The processing unit 1004 may be any commercially available or proprietary 
processor. In addition, the processing xmit may be implemented as multi-processor 
formed of more than one processor, such as may be connected in parallel. 

10 The system bus 1008 may be any of several types of bus structure including a 

memory bus or memory controller, a peripheral bus, and a local bus using any of a 
variety of conventional bus architectures such as PCI, VESA, MicroChannel, ISA, and 
EISA, to name a few. The system memory 1006 includes read only memory (ROM) 
1010 and random access memory (RAM) 1012. A basic input/output system (BIOS) 

15 1014, containing the basic routines that help to transfer information between elements 
within the computer 1002, such as during start-up, is stored in ROM 1010. 

The computer 1002 also may include, for example, a hard disk drive 1016, a 
magnetic disk drive 1018, e.g., to read from or write to a removable disk 1020, and an 
optical disk drive 1022, e.g., for reading from or writing to a CD-ROM disk 1024 or other 

20 optical media. The hard disk drive 1016, magnetic disk drive 1018, and optical disk drive 
1022 are connected to the system bus 1008 by a hard disk drive interface 1026, a 
magnetic disk drive interface 1028, and an optical drive interface 1030, respectively. The 
drives 1016-1022 and their associated computer-readable media provide nonvolatile 
storage of data, data structures, computer-executable instructions, etc, for the computer 

25 1002. Although the description of computer-readable media above refers to a hard disk, a 
removable magnetic disk and a CD, it should be appreciated by those skilled in the art 
that other types of media which are readable by a computer, such as magnetic cassettes, 
flash memory cards, digital video disks, Bernoulli cartridges, and the like, can also be 
used in the exemplary operating environment 1000, and fiulher that any such media may 

30 contain computer-executable instructions for performing the methods of the present 
invention. 
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A number of program modules may be stored in the drives 1016-1022 and RAM 
1012, including an operating system 1032, one or more application programs 1034, other 
program modules 1036, and program data 1038. The operating system 1032 may be any 
suitable operating system or combination of operating systems. By way of example, the 
5 application programs 1034 can include a collaborative filtering component that employs 
smoothing components that facilitate in providing a measure of association score in 
accordance with an aspect of the present invention. 

A user can enter commands and information into the computer 1002 through one 
or more user input devices, such as a keyboard 1040 and a pointing device (e.^., a mouse 

10 1042). Other input devices (not shown) may include a microphone, a joystick, a game 

pad, a satellite dish, wireless remote, a scanner, or the like. These and other input devices 
are often connected to the processing unit 1004 through a serial port interface 1044 that is 
coupled to the system bus 1008, but may be connected by other interfaces, such as a 
parallel port, a game port or a universal serial bus (USB). A monitor 1046 or other type 

15 of display device is also connected to the system bus 1008 via an interface, such as a 
video adapter 1048. In addition to the monitor 1046, the computer 1002 may include 
other peripheral output devices (not shown), such as speakers, printers, etc. 

It is to be appreciated that the computer 1002 can operate in a networked 
environment using logical connections to one or more remote computers 1060. The 

20 remote computer 1060 may be a workstation, a server computer, a router, a peer device or 
other conunon network node, and typically includes many or all of the elements 
described relative to the computer 1002, although, for purposes of brevity, only a 
memory storage device 1062 is illustrated in FIG. 10. The logical connections depicted 
in FIG. 10 can include a local area network (LAN) 1064 and a wide area network (WAN) 

25 1066. Such networking environments are commonplace in offices, enterprise-wide 
computer networks, intranets and the Intemet. 

When used in a LAN networking environment, for example, the computer 1002 is 
connected to the local network 1064 through a network interface or adapter 1068. When 
used in a WAN networking environment, the computer 1002 typically includes a modem 

30 (e.g., telephone, DSL, cable, etc.) 1070, or is connected to a communications server on 
the LAN, or has other means for estabUshing communications over the WAN 1066, such 
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as the Internet. The modem 1070, which can be intemal or external relative to the 
computer 1002, is connected to the system bus 1008 via the serial port interface 1044. In 
a networked environment, program modules (including application programs 1034) 
and/or program data 1038 can be stored in the remote memory storage device 1062. It 
5 will be appreciated that the network connections shown are exemplary and other means 
{e.g., wired or wireless) of establishing a communications link between the computers 
1002 and 1060 can be used when carrying out an aspect of the present invention. 

In accordance with the practices of persons skilled in the art of computer 
programming, the present invention has been described with reference to acts and 

10 symbolic representations of operations that are performed by a computer, such as the 
computer 1002 or remote computer 1060, unless otherwise indicated. Such acts and 
operations are sometimes referred to as being computer-executed. It will be appreciated 
that the acts and symboUcally represented operations include the manipulation by the 
processing unit 1004 of electrical signals representing data bits which causes a resulting 

15 transformation or reduction of the electrical signal representation, and the maintenance of 
data bits at memory locations in the memory system (including the system memory 1006, 
hard drive 1016, floppy disks 1020, CD-ROM 1024, and remote memory 1062) to 
thereby reconfigure or otherwise alter the computer system's operation, as well as other 
processing of signals. The memory locations where such data bits are maintained are 

20 physical locations that have particular electrical, magnetic, or optical properties 
corresponding to the data bits. 

FIG. 1 1 is another block diagram of a sample computing environment 1 100 with 
which the present invention can interact. The system 1 100 further illustrates a system 
that includes one or more cUent(s) 1 102. The chent(s) 1 102 can be hardware and/or 

25 software {e.g., threads, processes, computing devices). The system 1 100 also includes 
one or more server(s) 1 104. The server(s) 1 104 can also be hardware and/or software 
{e.g., threads, processes, computing devices). The servers 1 104 can house threads to 
perform transformations by employing the present invention, for example. One possible 
communication between a client 1 102 and a server 1 104 may be in the form of a data 

30 packet adapted to be transmitted between two or more computer processes. The system 
1 100 includes a communication framework 1 108 that can be employed to facihtate 
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communications between the client(s) 1 102 and the server(s) 1 104. The client(s) 1 102 
are operably connected to one or more client data store(s) 1110 that can be employed to 
store information local to the client(s) 1 102. Similarly, the server(s) 1 104 are operably 
connected to one or more server data store(s) 1 1 06 that can be employed to store 
5 information local to the servers 1 104. 

In one instance of the present invention, a data packet is transmitted between two 
or more computer components that facilitates collaborative filtering, the data packet 
comprised of, at least in part, collaborative filtering data based, at least in part, on 
collaborative filtering employing statistical smoothing techniques. 
10 In another instance of the present invention, a computer readable medium storing 

computer executable components of a system that facilitates collaborative filtering, the 
system comprised of, at least in part, a collaborative filtering component that produces 
item scoring based, at least in part, on a collaborative filtering employing statistical 
smoothing techniques. 

15 It is to be appreciated that the systems and/or methods of the present invention 

can be utiUzed in a collaborative filtering scheme for facilitating computer components 
and non-computer related components aUke. Further, those skilled in the art will 
recognize that the systems and/or methods of the present invention can be employed in a 
vast array of electronic related technologies, including, but not limited to, computers, 

20 servers, television related products, media products, search engines, business related 
products, and/or handheld electronic devices and the hke. 

What has been described above includes examples of the present invention. It is, 
of course, not possible to describe every conceivable combination of components or 
methodologies for purposes of describing the present invention, but one of ordinary skill 

25 in the art may recognize that many fiirther combinations and permutations of the present 
invention are possible. Accordingly, the present invention is intended to embrace all 
such alterations, modifications and variations that fall within the spirit and scope of the 
appended claims. Furthermore, to the extent that the term "includes" is used in either the 
detailed description or the claims, such term is intended to be inclusive in a manner 

30 similar to the term "comprising" as "comprising" is interpreted when employed as a 
transitional word in a claim. 
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